AMENDMENTS 



1 . (Currently amended) A document automatic classification system, comprising: 

list generation means for generating a word list for each of at least two categories by 

extracting words from a learning document set; 

unnecessary word determination means for relatively determining an unnecessary word for a 

category on the basis of a number of occurrences a frequency of appearance of a given word with in at 

least one e aeh other category by using the list generated by said list generation means; and 

means for generating a document classification catalog by eliminating words determined to 

be unnecessary words from each of the word lists. 

2. (Previously presented) The system according to Claim 1, wherein said list generation 
means generates a list indicating a frequency of appearance of a given word for each category. 

3. (Currently amended) The system according to Claim 1, wherein said unnecessary word 
determination means extracts a word belonging to a given category and determines it to be an 
unnecessary word in response to the word having a greater number of occurrences appearing mor e 
frequently in another category than is allowed by a given standard. 

4. (Previously presented) The system according to Claim 3, wherein the given standard is 
determined according to a predetermined threshold. 

5. (Previously presented) The system according to Claim 1, further comprising: 
document classification means for performing classification processing for classification 

target documents by using said document classification catalog. 

6. (Currently amended) A document automatic classification system, comprising: 

a classified document set storage device for storing documents classified according to at least 
two categories; 

a category table generation unit for generating a table, the table comprising: 



word lists corresponding to each of the at least two categories wherein the 
word lists are generated by extracting words from a learning document set; and 

frequencies comprising the a number of occurrences frequency of appearances 
of each extracted word within the learning document set; 
an unnecessary word elimination unit for eliminating an unnecessary word from a category in 
the table on the basis of a number of occurrences a frequency of appearance with in at least one e aeh 
other category of a given word , wherein said unnecessary word elimination unit extracts a word 
belonging to a given category and eliminates the word as an unnecessary word from said table in 
response to the word appearing more frequently in another category than is allowed by a given 
standard ; and 

a classification catalog storage device for storing the table from which the unnecessary word 
was eliminated by said unnecessary word elimination unit. 

7. (Original) The system according to Claim 6, further comprising: 

a classification target document storage device for storing classification target documents to 
be classified; and 

a document classification processing unit for performing classification processing for the 
classification target documents stored in said classification target document storage device by using 
said table stored in said classification catalog storage device. 

8. (Cancelled) 

9. (Previously presented) The system according to Claim 6, wherein said table contains 
information on each word, a frequency of appearance of each word, and a part of speech of each 
word. 

10. (Currently amended) An unnecessary word determination method in a document 
automatic classification system, comprising the steps of: 

generating a word list for each of at least two categories by extracting words from a learning 
document set, the word list containing information on a frequency of appearance of each extracted 



word within each category; 

determining an unnecessary word for a category on the basis of the a relative number of 
occurrences a frequency of appearance of a given word within at least one e aeh-other category; and 

eliminating words determined to be unnecessary words from each of the word lists. 

11. (Previously presented) The method according to Claim 10, wherein, in said step of 
determining the unnecessary word, the unnecessary word is determined according to whether one 
word selected from the given category appears in said other categories more frequently than is 
allowed by a given standard. 

12. (Previously presented) The method according to Claim 1 1 , wherein said given standard is 
a value obtained from a predetermined given threshold. 

13. (Original) The method according to Claim 1 1 , wherein said given standard is determined 
according to said frequency of the word in said other categories and a total frequency of all words in 
said other categories. 

14. (Currently amended) An unnecessary word determination method in a document 
automatic classification system, comprising the steps of: 

acquiring information on words from a document set, classifying the words according to 
category, and storing the words in a storage device; 

recognizing a number of occurrences frequency of appearance w ith in at least one e aeh-other 
category of a word belonging to a given category on the basis of the acquired information; 

determining whether the word is unnecessary for identifying the given category on the basis 
of the recognized frequency; and 

generating a document classification catalog by eliminating words determined to be 
unnecessary words. 

15. (Previously presented) The method according to Claim 14, further comprising 
storing said classification catalog into the storage device. 



16. (Previously presented) The method according to Claim 1 5 , further comprising the step of 
performing classification processing for classification target documents by using the classification 
catalog stored in said storage device. 



